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A Zinc Finger Protein Derived From Hematopoietic Cells 

FIELD OF INVENTION 

5 This invention relates to newly identified polynucleotides, polypeptides encoded by them 

and to the use of such polynucleotides and polypeptides, and to their production. More 
particularly, the polynucleotides and polypeptides of the present invention relate to the zinc finger 
protein gene family, hereinafter referred to as CBMACD04. The invention also relates to inhibiting 
or activating the action of such polynucleotides and polypeptides. 

10 

BACKGROUND OF THE INVENTION 

Proteins containing zinc finger motifs are often transcription factors or transcription repressors 
involved in cell division and proliferation.. A typical zinc finger motif is the C2H2 motif, which consists 
of repeated domains of two invariant cysteines and two histidines, with a loop of variable length, but 

1 5 containing an invariant leucine and either a tyrosine or phenylalanine located within each loop. Zinc 
finger proteins can bind directly to the DNA double helix. The transcription factor TFIIIA is one 
example of a C2H2 zinc finger protein, and the first identified C2H2 protein. The C2H2 zinc finger 
motif has also been identified in a number of other transcription factors, presumed transcription factors, 
and steroid receptors. This indicates that the zinc finger protein gene family has an established, proven 

20 history as therapeutic targets. Clearly there is a need for identification and characterization of further 
members of the zinc finger protein gene family which can play a role in preventing, ameliorating or 
correcting dysfunctions or diseases, including, but not limited to, cancer and AIDS. 

SUMMARY OF THE INVENTION 

25 In one aspect, the invention relates to CBMACD04 polypeptides and recombinant materials 

and methods for their production. Another aspect of the invention relates to methods for using such 
CBMACD04 polypeptides and polynucleotides. Such uses include the treatment of cancer and AIDS, 
among others. In still another aspect, the invention relates to methods to identify agonists and 
antagonists using the materials provided by the invention, and treating conditions associated with 

30 CBMACD04 imbalance with the identified compounds. Yet another aspect of the invention relates to 
diagnostic assays for detecting diseases associated with inappropriate CBMACD04 activity or levels. 
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DESCRIPTION OF THE INVENTION 
Definitions 

The following definitions are provided to facilitate understanding of certain terms used 
frequently herein. 

5 "CBMACD04" refers, among others, generally to a polypeptide having the amino acid 

sequence set forth in SEQ ID NO:2 or an allelic variant thereof. 

"CBMACD04 activity or CBMACD04 polypeptide activity" or "biological activity of the 
CBMACD04 or CBMACD04 polypeptide" refers to the metabolic or physiologic function of said 
CBMACD04 including similar activities or improved activities or these activities with decreased 
1 0 undesirable side-effects. Also included are antigenic and immunogenic activities of said 
CBMACD04. 

"CBMACD04 gene" refers to a polynucleotide having the nucleotide sequence set forth in 
SEQ ID NO: 1 or allelic variants thereof and/or their complements. 

"Antibodies" as used herein includes polyclonal and monoclonal antibodies, chimeric, 

1 5 single chain, and humanized antibodies, as well as Fab fragments, including the products of an Fab 
or other immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its original 
environment, or both. For example, a polynucleotide or a polypeptide naturally present in a living 

20 animal is not "isolated," but the same polynucleotide or polypeptide separated from the coexisting 
materials of its natural state is "isolated", as the term is employed herein. < : 

"Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotides" include, 
without limitation single- and double-stranded DNA, DNA that is a mixture of single- and double- 

25 stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded 
or, more typically, double-stranded or a mixture of single- and double-stranded regions. In 
addition, "polynucleotide" refers to triple-stranded regions comprising RNA or DNA or both RNA 
and DNA. The term polynucleotide also includes DNAs or RNAs containing one or more modified 

30 bases and DNAs or RNAs with backbones modified for stability or for other reasons. "Modified" 
bases include, for example, tritylated bases and unusual bases such as inosine. A variety of 
modifications has been made to DNA and RNA; thus, "polynucleotide" embraces chemically, 
enzymatically or metabolically modified forms of polynucleotides as typically found in nature, as 



2 
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well as the chemical forms of DNA and RNA characteristic of viruses and cells. "Polynucleotide" 
also embraces relatively short polynucleotides, often referred to as oligonucleotides. 

"Polypeptide" refers to any peptide or protein comprising two or more amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. "Polypeptide" 
5 refers to both short chains, commonly referred to as peptides, oligopeptides or oligomers, and to 
longer chains, generally referred to as proteins. Polypeptides may contain amino acids other than 
the 20 gene-encoded amino acids. "Polypeptides" include amino acid sequences modified either by 
natural processes, such as posttranslational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well described in basic texts and in more 

1 0 detailed monographs, as well as in a voluminous research literature. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched as a result of 

1 5 ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched and branched 
cyclic polypeptides may result from posttranslation natural processes or may be made by synthetic 
methods. Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 

20 phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation 
of covalent cross-links, formation of cystine, formation of pyroglutamate, formylation, gammar 
carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as 

25 arginylation, and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND 

MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New 
York, 1993 and Wold, F., Posttranslational Protein Modifications: Perspectives and Prospects, 
pgs. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York, 1983; Seifter et al 9 "Analysis for protein modifications 

30 and nonprotein cofactors", Meth Enzymol (1990) 182:626-646 and Rattan et aL, "Protein 
Synthesis: Posttranslational Modifications and Aging", Ann NY Acad Sci (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs from a 
reference polynucleotide or polypeptide respectively, but retains essential properties. A typical 
variant of a polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 

3 
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Changes in the nucleotide sequence of the variant may or may not alter the amino acid sequence of 
a polypeptide encoded by the reference polynucleotide. Nucleotide changes may result in amino 
acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the 
reference sequence, as discussed below. A typical variant of a polypeptide differs in amino acid 
sequence from another, reference polypeptide. Generally, differences are limited so that the 
sequences of the reference polypeptide and the variant are closely similar overall and, in many 
regions, identical. A variant and reference polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, deletions in any combination. A substituted or inserted amino acid 
residue may or may not be one encoded by the genetic code. A variant of a polynucleotide or 
polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant that is 
not known to occur naturally. Non-naturally occurring variants of polynucleotides and 
polypeptides may be made by mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
or two or more polynucleotide sequences, as determined by comparing the sequences. In the art, 
"identity" also means the degree of sequence relatedness between polypeptide or polynucleotide 
sequences, as the case may be, as determined by the match between strings of such sequences. 

"Identity" and "similarity" can be readily calculated by known methods, including but 
not limited to those described in (Computational Molecular Biology , Lesk, A.M., ed., Oxford 
University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, 
D.W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, 
Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis , 
in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., 
and Lipman, D., SIAM J. Applied Math., 48: 1073 (1988). Preferred methods to determine 
identity are designed to give the largest match between the sequences tested. Methods to 
determine identity and similarity are codified in publicly available computer programs. 
Preferred computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, the GCG program package (Devereux, J., et al., 
Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. et 
al., J. Molec. BioL 215: 403-410 (1990). The BLAST X program is publicly available from 
NCBI and other sources (BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, MD 
20894; Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990). The well known Smith 
Waterman algorithm may also be used to determine identity. 
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Preferred parameters for polypeptide sequence comparison include the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. 
USA. 89:10915-10919 (1992) 
5 Gap Penalty: 12 

Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program 
from Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for polypeptide comparisons (along with no penalty for end gaps). 

10 

Preferred parameters for polynucleotide comparison include the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
15 Gap Length Penalty: 3 

A program useful with these parameters is publicly available as the "gap" program 
from Genetics Computer Group, Madison WL The aforementioned parameters are the default 
parameters for polynucleotide comparisons. 

20 Preferred polynucleotide embodiments further include an isolated polynucleotide 

comprising a polynucleotide having at least a 50,60, 70, 80, 85, 90, 95, 97 qr 100% . identity to 
a polynucleotide reference sequence of SEQ ID NO:l, wherein said reference sequence may be 
identical to the sequence of SEQ ID NO: 1 or may include up to a certain integer number of 
nucleotide alterations as compared to the reference sequence, wherein said alterations are 

25 selected from the group consisting of at least one nucleotide deletion, substitution, including 
transition and transversion, or insertion, and wherein said alterations may occur at the 5' or 3' 
terminal positions of the reference nucleotide sequence or anywhere between those terminal 
positions, interspersed either individually among the nucleotides in the reference sequence or in 
one or more contiguous groups within the reference sequence, and wherein said number of 

30 nucleotide alterations is determined by multiplying the total number of nucleotides in SEQ ID 
NO:l by the numerical percent of the respective percent identity and subtracting that product 
from said total number of nucleotides in SEQ ID NO:l, or: 

n n < x n - (x n • y), 

5 
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wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides in 
SEQ ID NO:l, and y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and wherein any non- 
integer product of x n and y is rounded down to the nearest integer prior to subtracting it from 
x n . Alterations of a polynucleotide sequence encoding the polypeptide of SEQ ID NO:2 may 
create nonsense, missense or frameshift mutations in this coding sequence and thereby alter the 
polypeptide encoded by the polynucleotide following such alterations. 

Preferred polypeptide embodiments further include an isolated polypeptide comprising 
a polypeptide having at least a 50,60, 70, 80, 85, 90, 95, 97 or 100% identity to a polypeptide 
reference sequence of SEQ ID NO:2, wherein said reference sequence may be identical to the 
sequence of SEQ ID NO: 2 or may include up to a certain integer number of amino acid 
alterations as compared to the reference sequence, wherein said alterations are selected from 
the group consisting of at least one amino acid deletion, substitution, including conservative 
and non-conservative substitution, or insertion, and wherein said alterations may occur at the 
amino- or carboxy-terminal positions of the reference polypeptide sequence or anywhere 
between those terminal positions, interspersed either individually among the amino acids in the 
reference sequence or in one or more contiguous groups within the reference sequence, and 
wherein said number of amino acid alterations is determined by multiplying the total number of 
amino acids in SEQ ID NO:2 by the numerical percent of the respective percent identity and ... 
subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 

n a <x a -(x a #y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino 
acids in SEQ ID NO:2, and y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 
0.85 for 85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and wherein any 
non-integer product of x a and y is rounded down to the nearest integer prior to subtracting it 
from x a . 

Polypeptides of the Invention 

In one aspect, the present invention relates to CBMACD04 polypeptides (or CBMACD04 
proteins). The CBMACD04 polypeptides include the polypeptide of SEQ ID NO:2; as well as 

6 
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polypeptides comprising the amino acid sequence of SEQ ID NO: 2; and polypeptides comprising 
the amino acid sequence which have at least 80% identity to that of SEQ ID NO:2 over its entire 
length, and still more preferably at least 90% identity, and even still more preferably at least 95% 
identity to SEQ ID NO: 2. Furthermore, those with at least 97-99% are highly preferred. Also 
5 included within CBMACD04 polypeptides are polypeptides having the amino acid sequence which 
have at least 80% identity to the polypeptide having the amino acid sequence of SEQ ID NO .2 over 
its entire length, and still more preferably at least 90% identity, and still more preferably at least 
95% identity to SEQ ID NO:2. Furthermore, those with at least 97-99% are highly preferred. 
Preferably CBMACD04 polypeptide exhibit at least one biological activity of CBMACD04. 
10 The CBMACD04 polypeptides may be in the form of the "mature" protein or may be a part 

of a larger protein such as a fusion protein. It is often advantageous to include an additional amino 
acid sequence which contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

1 5 Fragments of the CBMACD04 polypeptides are also included in the invention. A fragment 

is a polypeptide having an amino acid sequence that entirely is the same as part, but not all, of the amino 
acid sequence of the aforementioned CBMACD04 polypeptides. As with CBMACD04 polypeptides, 
fragments may be "free-standing," or comprised within a larger polypeptide of which they form a part or 
region, most preferably as a single continuous region. Representative examples of polypeptide fragments 

20 of the invention, include, for example, fragments from about amino acid number 1-20, 2 1-40, 41-60, 61- 
80, 8 1-100, and 101 to the end of CBMACD04 polypeptide. In this context "about". includes the;W 
particularly recited ranges larger or smaller by several, 5, 4, 3, 2 or 1 amino acid at either extreme or at 
both extremes. 

Preferred fragments include, for example, truncation polypeptides having the amino acid 
25 sequence of CBMACD04 polypeptides, except for deletion of a continuous series of residues that 

includes the amino terminus, or a continuous series of residues that includes the carboxyl terminus or 
deletion of two continuous series of residues, one including the amino terminus and one including the 
carboxyl terminus. Also preferred are fragments characterized by structural or functional attributes such 
as fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet and beta-sheet- 
30 forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic regions, 

hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
forming regions, substrate binding region, and high antigenic index regions. Other preferred fragments 
are biologically active fragments. Biologically active fragments are those that mediate CBMACD04 
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activity, including those with a similar activity or an improved activity, or with a decreased undesirable 
activity. Also included are those that are antigenic or immunogenic in an animal, especially in a human. 

Preferably, all of these polypeptide fragments retain the biological activity of the CBMACD04, 
including antigenic activity. Variants of the defined sequence and fragments also form part of the present 
5 invention. Preferred variants are those that vary from the referents by conservative amino acid 

substitutions — i.e., those that substitute a residue with another of like characteristics. Typical such 
substitutions are among Ala, Val, Leu and He; among Ser and Thr; among the acidic residues Asp and 
Glu; among Asn and Gin; and among the basic residues Lys and Arg; or aromatic residues Phe and Tyr. 
Particularly preferred are variants in which several, 5-10, 1-5, or 1-2 amino acids are substituted, 
1 0 deleted, or added in any combination. 

The CBMACD04 polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced polypeptides, 
synthetically produced polypeptides, or polypeptides produced by a combination of these methods. 
Means for preparing such polypeptides are well understood in the art. 

15 

Polynucleotides of the Invention 

Another aspect of the invention relates to CBMACD04 polynucleotides. CBMACD04 
polynucleotides include isolated polynucleotides which encode the CBMACD04 polypeptides and 
fragments, and polynucleotides closely related thereto. More specifically, CBMACD04 polynucleotide 

20 of the invention include a polynucleotide comprising the nucleotide sequence contained in SEQ ID NO: 1 
encoding a CBMACD04 polypeptide of SEQ ID NO: 2, and polynucleotide having the particular 
sequence of SEQ ID NO: 1 . CBMACD04 polynucleotides further include a polynucleotide comprising a 
nucleotide sequence that has at least 80% identity over its entire length to a nucleotide sequence encoding 
the CBMACD04 polypeptide of SEQ ID NO:2, and a polynucleotide comprising a nucleotide 

25 sequence that is at least 80% identical to of SEQ ID NO: 1 over its entire length. In this regard, 
polynucleotides at least 90% identical are particularly preferred, and those with at least 95% are 
especially preferred. Furthermore, those with at least 97% are highly preferred and those with at least 
98-99% are most highly preferred, with at least 99% being the most preferred. Also included under 
CBMACD04 polynucleotides are a nucleotide sequence which has sufficient identity to a nucleotide 

30 sequence contained in SEQ ID NO:l to hybridize under conditions useable for amplification or for 
use as a probe or marker. The invention also provides polynucleotides which are complementary to 
such CBMACD04 polynucleotides. 

CBMACD04 of the invention is structurally related to other proteins of the zinc finger protein 
gene family, as shown by the results of sequencing the cDNA of Table 1 (SEQ ID NO: 1) encoding 

8 
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human CBMACD04. The cDNA sequence of SEQ ID NO: 1 contains an open reading frame 
(nucleotide number 135 to 1 193) encoding a polypeptide of 353 amino acids of SEQ ID NO:2. The 
amino acid sequence of Table 2 (SEQ ID NO:2) has about 79.2% identity (using FASTA) in 35 1 amino 
acid residues with Human Kruppel related zinc finger protein HTF10 (J. Bellefroid et al., Proc. Natl. 
5 Acad. Sci. U.S.A. 88 (9), 3608-3612,1991). The nucleotide sequence of Table 1 (SEQ ID NO:l) has 
about 69% identity (using FASTA) in 1619 nucleotide residues with Human Kruppel related zinc finger 
protein HTF10 (E.J. Bellefroid et al., Proc. Natl. Acad. Sci. U.S.A. 88 (9), 3608-3612,1991). Thus, 
CBMACD04 polypeptides and polynucleotides of the present invention are expected to have, inter alia, 
similar biological functions/properties to their homologous polypeptides and polynucleotides, and their 
1 0 utility is obvious to anyone skilled in the art. 



Table T 



1 


GGGGATATGG 


CGGGGCCTTT 


GTCTCTCGCT 


GTCGCCGGAG 


TCCCAGGTCT 




5 1 


G 1 C I I L. AC I G 


CTCTGTGTCC 


TCTGCTCCTA 


GAGGCCCAGC 


CTCTGTGGCG 




1U1 




LAb 1A1 1 Go A 


GA1 CCAL-AGC 


IMbAIbLLA 


r+c*7\ /** /^t* 7\ 
GGAL.CC-CCTA 










± ± 1 i in 




V-M. i lit 




201 


TCTCTGGAGG 


AGTGGCAACA 


CCTGGACATT 


G C AC AGCAG A 


ATTTATATAG 




251 


AAATGTGATG 


TTAGAGAACT 


ACAGAAACCT 


GGCCTTCCTG 


GGTATTGCTG 




301 


TCTCTAAGCC 


AGACCTGATC 


AC C TGTCTGG 


AACAAGGGAA 


AGAGCCCTGG 




351 


AATATGAAGC 


GACATGAGAT 


GGTGGATGAA 


CCCCCAGGTA 


TGTGTCCTCA 




401 


TTTTGCTCAA 


GACCTTTGGC 


CAGAGCAGGG 


CATGGAAGAT 


TCTTTTCAAA 




451 


AAGCAATACT 


GAGAAGATAT 


GGAAAATATG 


GACATGAGAA 


TTTACAGTTA 




501 


AG AAAAGG CT 


GTAAAAGTGT 


GGATGAGTAT 


AAGGTGAACA 


AAGAAGGTTA 




551 


TAATGGACTT 


AACCAGTGTT 


TCACAACTGC 


CCAGAGCAAA 


GTATTTCAAT 




601 


G TG AT AAATA 


TTTGAAAGTC 


TTCTATAAAT 


TTTTAAATTC 


AAACAGACCT 




651 


AAGATAAGAC 


ATACTGAAAA 


GAAATCTTTC 


AAATGTAAAA 


AACGTGTCAA 
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701 


ATT ATTTTG C 


ATGCTTTCAC 


ATAAAACCCA 


ACACAAAAGC 


ATTTATCATA 


751 


GAGAGAAGTC 


CTACAAATGT 


AAAGAATGTG 


GAAAAACCTT 


TAATTGGTCC 


801 


TCAACCCTTA 


CTAATCATAG 


GAAAATTTAT 


ACTGAAGAGA 


AACCTTACAA 


851 


ATGTGAAGAA 


TATAACAAAT 


CTCCTAAGCA 


ACTCTCAACC 


CTTACTACAC 


901 . 


ATGAAATAAT 


TCATGCTGGA 


GAGAAACTCT 


ACAAATGTGA 


AG AATGTGG C 


951 


AAAGCTTTTA 


ATCGGTCCTC 


AACTTTTACT 


AAACATAAGG 


TAATTCATAC 


1001 


TGGAGTAAAA 


CCCTACAAAT 


GTGAAGAATG 


TGGCAAAGCA 


TTTTTCTGGT 


1051 


CCTCAACCCT 


AACTAAACAT 


AAGAGAATTC 


ATACTGGAGA 


GCAACCCTAC 


1101 


AAATGGGAAA 


AATTTGGCAA 


AGCCTTTAAT 


CGGTCCTCGC 


ACCTCACCAC 


1151 


AGATAAGATA 


ACTCATACTG 


GAGAGAAATC 


TTACAAGTAT 


GAATAATGTG 


1201 


CCAAAGCCTA 


AGAAAAC CCT 


CAATTCTTAA 


TAGATATAAG 


ATTATTCCTA 


1251 


CTGGAGAGAA 


ACTACAAACC 


TGAGAGAGGC 


GCTAATGCTT 


TTGACAGTAC 


1301 


CTAAAACTTT 


AAAGAAAATC 


ATTCTGCTGA 


AAAATCCTAG 


AAATGTGAAG 


1351 


AATGTGAAAA 


AGCCTTTAAA 


TGATTGTCAC 


ACTTGATTGT 


AGGTAAGATA 


1401 


ATTCATACTG 


GAGAAAACTA 


CCAGTGTGAA 


CAACGTGGCC 


AAGCTTCGAC 


1451 


AATGCTCACA 


CCCTATTGCA 


CAGGAAAGCA 


TTTATACTTG 


AGAAGAAATG 


1501 


TACAAATATT 


GGCAAAGTAA 


AAAATCCATT 


AACACCTGCT 


CACATCTTAC 


1551 


TCAAAATTGT 


AGAGTTCATA 


GTAAATAAAA 


GCATTAAAAT 


TCAAAAAAAA 


1601 


AAAAAAAAAA 


AAAAAAAAA 









4 A nucleotide sequence of a human CBMACD04 (SEQ ID NO: 1). 



Table 2 b 

1 MPGPPRSLEM GLLTFRDVA I EFSLEEWQHL DIAQQNLYRN VMLENYRNLA 
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51 


FLGIAVSKPD 


LITCLEQGKE 


PWNMKRHEMV 


DEPPGMCPHF 


AQDLWPEQGM 


101 


EDSFQKAILR 


RYGKYGHENL 


QLRKGCKSVD 


EYKVNKEGYN 


GLNQCFTTAQ 


151 


SKVFQCDKYL 


KVFYKFLNSN 


RPKIRHTEKK 


SFKCKKRVKL 


FCMLSHKTQH 


201 


KSIYHREKSY 


KCKECGKTFN 


WSSTLTNHRK 


IYTEEKPYKC 


EEYNKSPKQL 


251 


STLTTHEIIH 


AGEKLYKCEE 


CGKAFNRSST 


FTKHKVIHTG 


VKPYKCEECG 


301 


KAFFWSSTLT 


KHKRIHTGEQ 


PYKWEKFGKA 


FNRS SHLTTD 


KITHTGEKSY 


351 


KYE 











b An amino acid sequence of a human CBMACD04 (SEQ ID NO: 2). 



One polynucleotide of the present invention encoding CBMACD04 may be obtained using 
5 standard cloning and screening, from a cDNA library derived from mRNA in cells of human cord blood 
using the expressed sequence tag (EST) analysis (Adams, M.D., et al Science (1991) 252:1651- 
1656; Adams, M.D. et al, Nature, (1992) 555:632-634; Adams, M.D., et al, Nature (1995) 377 
Supp:3-174). Polynucleotides of the invention can also be obtained from natural sources such as 
genomic DNA libraries or can be synthesized using well known and commercially available 
10 techniques. 

The nucleotide sequence encoding CBMACD04 polypeptide of SEQ ID NO:2 may be 
identical to the polypeptide encoding sequence contained in Table 1 (nucleotide number 135 to 1 193 
of SEQ ID NO: 1), or it may be a sequence, which as a result of the redundancy (degeneracy) of the 
genetic code, also encodes the polypeptide of SEQ ID NO:2. 

1 5 When the polynucleotides of the invention are used for the recombinant production of 

CBMACD04 polypeptide, the polynucleotide may include the coding sequence for the mature 
polypeptide or a fragment thereof, by itself; the coding sequence for the mature polypeptide or fragment 
in reading frame with other coding sequences, such as those encoding a leader or secretory sequence, a 
pre-, or pro- or prepro- protein sequence, or other fusion peptide portions. For example, a marker 

20 sequence which facilitates purification of the fused polypeptide can be encoded. In certain preferred 

embodiments of this aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided 
in the pQE vector (Qiagen, Inc.) and described in Gentz et al. , Proc Natl Acad Sci USA (1989) 86:82 1 - 
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824, or is an HA tag. The polynucleotide may also contain non-coding 5' and 3' sequences, such as 
transcribed, non-translated sequences, splicing and polyadenylation signals, ribosome binding sites and 
sequences that stabilize mRNA. 

Further preferred embodiments are polynucleotides encoding CBMACD04 variants comprise 
5 the amino acid sequence CBMACD04 polypeptide of Table 2 (SEQ ID NO:2) in which several, 5-10, 
1-5, 1-3, 1-2 or 1 amino acid residues are substituted, deleted or added, in any combination. 

The present invention further relates to polynucleotides that hybridize to the herein above- 
described sequences. In this regard, the present invention especially relates to polynucleotides which 
hybridize under stringent conditions to the herein above-described polynucleotides. As herein used, the 
1 0 term "stringent conditions" means hybridization will occur only if there is at least 80%, and preferably at 
least 90%, and more preferably at least 95%, yet even more preferably 97-99% identity between the 
sequences. 

Polynucleotides of the invention, which are identical or sufficiently identical to a nucleotide 
sequence contained in SEQ ID NO: 1 or a fragment thereof, may be used as hybridization probes for 
1 5 cDNA and genomic DNA, to isolate full-length cDNAs and genomic clones encoding CBMACD04 

polypeptide and to isolate cDN A and genomic clones of other genes (including genes encoding homologs 
and orthologs from species other than human) that have a high sequence similarity to the CBMACD04 
gene. Such hybridization techniques are known to those of skill in the art. Typically these nucleotide 
sequences are 80% identical, preferably 90% identical, more preferably 95% identical to that of the 
20 referent. The probes generally will comprise at least 15 nucleotides. Preferably, such probes will have at 
... .least 30 nucleotides and may have at least SO.nucleotides. Particularly preferred probes will range ,. 
between 30 and 50 nucleotides. 

In one embodiment, to obtain a polynucleotide encoding CBMACD04 polypeptide, including 
homologs and orthologs from species other than human, comprises the steps of screening an appropriate 
25 library under stingent hybridization conditions with a labeled probe having the SEQ ID NO: 1 or a 
fragment thereof; and isolating full-length cDNA and genomic clones containing said polynucleotide 
sequence. Thus in another aspect, CBMACD04 polynucleotides of the present invention further include 
a nucleotide sequence comprising a nucleotide sequence that hybridize under stringent condition to a 
nucleotide sequence having SEQ ID NO: 1 or a fragment thereof Also included with CBMACD04 
30 polypeptides are polypeptide comprising amino acid sequence encoded by nucleotide sequence obtained 
by the above hybridization condition. Such hybridization techniques are well known to those of skill in 
the art. Stringent hybridization conditions are as defined above or, alternatively, conditions under 
overnight incubation at 42°C in a solution comprising: 50% formamide, 5xSSC (150mM NaCl, 15mM 
trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10 % dextran sulfate, and 

12 
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20 microgram/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0. lx SSC at 
about 65°C 

The polynucleotides and polypeptides of the present invention may be employed as research 
reagents and materials for discovery of treatments and diagnostics to animal and human disease. 

5 

Vectors, Host Cells, Expression 

The present invention also relates to vectors which comprise a polynucleotide or polynucleotides 
of the present invention, and host cells which are genetically engineered with vectors of the invention and 
to the production of polypeptides of the invention by recombinant techniques. Cell-free translation 
1 0 systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of 
the present invention. 

For recombinant production, host cells can be genetically engineered to incorporate expression 
systems or portions thereof for polynucleotides of the present invention. Introduction of polynucleotides 
into host cells can be effected by methods described in many standard laboratory manuals, such as Davis 

15 et aL , BASIC METHODS IN MOLECULAR BIOLOGY (1 986) and Sambrook et al., MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. (1989) such as calcium phosphate transfection, DEAE-dextran mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape 
loading, ballistic introduction or infection. 

20 Representative examples of appropriate hosts include bacterial cells, such as streptococci, 

staphylococci, E co//^ 

Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used. Such systems include, among others, 

25 chromosomal, episomal and virus-derived systems, e.g., vectors derived from bacterial plasmids, from 

bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, 
adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as 

30 cosmids and phagemids. The expression systems may contain control regions that regulate as well as 
engender expression. Generally, any system or vector suitable to maintain, propagate or express 
polynucleotides to produce a polypeptide in a host may be used. The appropriate nucleotide sequence 
may be inserted into an expression system by any of a variety of well-known and routine techniques, 

13 
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such as, for example, those set forth in Sambrook et al., MOLECULAR CLONING, A LABORATORY 
MANUAL (supra). 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
periplasmic space or into the extracellular environment, appropriate secretion signals may be 
5 incorporated into the desired polypeptide. These signals may be endogenous to the polypeptide or they 
may be heterologous signals. 

If the CBMACD04 polypeptide is to be expressed for use in screening assays, generally, it is 
preferred that the polypeptide be produced at the surface of the cell. In this event, the cells may be 
harvested prior to use in the screening assay. If CBMACD04 polypeptide is secreted into the 
1 0 medium, the medium can be recovered in order to recover and purify the polypeptide; if produced 
intracellular^, the cells must first be lysed before the polypeptide is recovered. 
CBMACD04 polypeptides can be recovered and purified from recombinant cell cultures by well-known 
methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
1 5 chromatography, hydroxylapatite chromatography and lectin chromatography. Most preferably, high 
performance liquid chromatography is employed for purification. Well known techniques for refolding 
proteins may be employed to regenerate active conformation when the polypeptide is denatured during 
isolation and or purification. 

Diagnostic Assays 

This invention also relates to the use of CBMACD04 polynucleotides for use as diagnostic 
reagents. Detection of a mutated form of CBMACD04 gene associated with a dysfunction will provide a 
diagnostic tool that can add to or define a diagnosis of a disease or susceptibility to a disease which 
results from under-expression, over-expression or altered expression of CBMACD04. Individuals 
carrying mutations in the CBMACD04 gene may be detected at the DNA level by a variety of 
techniques. 

Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, 
saliva, tissue biopsy or autopsy material. The genomic DNA may be used directly for detection or may 
be amplified enzymatically by using PCR or other amplification techniques prior to analysis. RNA or 
cDNA may also be used in similar fashion. Deletions and insertions can be detected by a change in size 
of the amplified product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled CBMACD04 nucleotide sequences. Perfectly matched sequences 
can be distinguished from mismatched duplexes by RNase digestion or by differences in melting 
temperatures. DNA sequence differences may also be detected by alterations in electrophoretic mobility 

14 
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of DNA fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et aL, Science (1985) 230:1242. Sequence changes at specific locations may also be revealed by 
nuclease protection assays, such as RNase and S 1 protection or the chemical cleavage method. See 
Cotton et al , Proc Natl Acad Sci USA ( 1 985) 85 : 4397-440 1 . In another embodiment, an array of 
5 oligonucleotides probes comprising CBMACD04 nucleotide sequence or fragments thereof can be 

constructed to conduct efficient screening of e.g., genetic mutations. Array technology methods are well 
known and have general applicability and can be used to address a variety of questions in molecular 
genetics including gene expression, genetic linkage, and genetic variability. (See for example: M.Chee et 
al., Science, Vol 274, pp 610-613 (1996)). 

1 0 The diagnostic assays offer a process for diagnosing or determining a susceptibility to cancer 

and AIDS through detection of mutation in the CBMACD04 gene by the methods described. 

In addition, cancer and AIDS, can be diagnosed by methods comprising determining from a 
sample derived from a subject an abnormally decreased or increased level of CBMACD04 
polypeptide or CBMACD04 mRNA. Decreased or increased expression can be measured at the 

1 5 RNA level using any of the methods well known in the art for the quantitation of polynucleotides, 
such as, for example, PCR, RT-PCR, RNase protection, Northern blotting and other hybridization 
methods. Assay techniques that can be used to determine levels of a protein, such as an CBMACD04 
polypeptide, in a sample derived from a host are well-known to those of skill in the art. Such assay 
methods include radioimmunoassays, competitive-binding assays, Western Blot analysis and ELISA 

20 assays. 

Thus in another aspect, the present invention relates to a diagonostic kit for a diseasgjor 
suspectability to a disease, particularly cancer and AIDS, which comprises: 
(a) a CBMACD04 polynucleotide, preferably the nucleotide sequence of SEQ ID NO: 1, or a 
fragment thereof; 
25 (b) a nucleotide sequence complementary to that of (a); 

(c) a CBMACD04 polypeptide, preferably the polypeptide of SEQ ID NO: 2, or a fragment 
thereof; or 

(d) an antibody to a CBMACD04 polypeptide, preferably to the polypeptide of SEQ ID NO: 2. 
It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 

30 component. 



Chromosome Assays 

The nucleotide sequences of the present invention are also valuable for chromosome 
identification. The sequence is specifically targeted to and can hybridize with a particular location on an 

15 
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individual human chromosome. The mapping of relevant sequences to chromosomes according to the 
present invention is an important first step in correlating those sequences with gene associated disease. 
Once a sequence has been mapped to a precise chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic map data. Such data are found, for 
5 example, in V. McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 

University Welch Medical Library). The relationship between genes and diseases that have been mapped 
to the same chromosomal region are then identified through linkage analysis (coinheritance of physically 
adjacent genes). 

The differences in the cDNA or genomic sequence between affected and unaffected 
10 individuals can also be determined. If a mutation is observed in some or all of the affected 

individuals but not in any normal individuals, then the mutation is likely to be the causative agent 
of the disease. 



1 5 Antibodies 

The polypeptides of the invention or their fragments or analogs thereof, or cells expressing them 
can also be used as immunogens to produce antibodies immunospecific for the CBMACD04 
polypeptides. The term "immunospecific" means that the antibodies have substantiall greater affinity for 
the polypeptides of the invention than their affinity for other related polypeptides in the prior art. 

20 Antibodies generated against the CBMACD04 polypeptides can be obtained by administering 

- the polypeptides or epitope-bearing fragments, analogs or cells to an animal, preferably a nonhuman, 
using routine protocols. For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler, G. and Milstein, C, Nature (1975) 256:495-497), the trioma technique, the human 

2 5 B-cell hybridoma technique (Kozbor et al , Immunology Today ( 1 983) 4:72) and the EBV-hybridoma 
technique (Cole et al % MONOCLONAL ANTIBODIES AND CANCER THERAPY, pp. 77-96, Alan 
R. Liss, Inc., 1985). 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can also 
be adapted to produce single chain antibodies to polypeptides of this invention. Also, transgenic mice, or 
30 other organisms including other mammals, may be used to express humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones expressing the 
polypeptide or to purify the polypeptides by affinity chromatography. 

Antibodies against CBMACD04 polypeptides may also be employed to treat cancer and AIDS, 
among others. 

16 
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Vaccines 

Another aspect of the invention relates to a method for inducing an immunological 
response in a mammal which comprises inoculating the mammal with CBMACD04 polypeptide, or 
5 a fragment thereof, adequate to produce antibody and/or T cell immune response to protect said 
animal from cancer and AIDS, among others. Yet another aspect of the invention relates to a 
method of inducing immunological response in a mammal which comprises, delivering 
CBMACD04 polypeptide via a vector directing expression of CBMACD04 polynucleotide in vivo 
in order to induce such an immunological response to produce antibody to protect said animal from 
10 diseases. 

Further aspect of the invention relates to an immunological/vaccine formulation 
(composition) which, when introduced into a mammalian host, induces an immunological response 
in that mammal to a CBMACD04 polypeptide wherein the composition comprises a CBMACD04 
polypeptide or CBMACD04 gene. The vaccine formulation may further comprise a suitable 

15 carrier. Since CBMACD04 polypeptide may be broken down in the stomach, it is preferably 
administered parenterally (including subcutaneous, intramuscular, intravenous, intradermal etc. 
injection). Formulations suitable for parenteral administration include aqueous and non-aqueous 
sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which 
render the formulation instonic with the blood of the recipient; and aqueous and non-aqueous sterile 

20 suspensions which may include suspending agents or thickening agents. The formulations may be 

presented in unit-dose or multi-dose containers, for example, sealed ampoules and vials and may be - 
stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier immediately 
prior to use. The vaccine formulation may also include adjuvant systems for enhancing the 
immunogenicity of the formulation, such as oil-in water systems and other systems known in the 

25 art. The dosage will depend on the specific activity of the vaccine and can be readily determined by 
routine experimentation. 

Screening Assays 

The CBMACD04 polypeptide of the present invention may be employed in a screening process 
30 for compounds which activate (agonists) or inhibit activation of (antagonists, or otherwise called 

inhibitors) the CBMACD04 polypeptide of the present invention. Thus, polypeptides of the invention 
may also be used to assess identify agonist or antagonists from, for example, cells, cell-free preparations, 
chemical libraries, and natural product mixtures. These agonists or antagonists may be natural or 
modified substrates, ligands, receptors, enzymes, etc., as the case may be, of the polypeptide of the 

17 
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present invention; or may be structural or functional mimetics of the polypeptide of the present invention. 
See Coligan etal y Current Protocols in Immunology l(2):Chapter 5 (1991). 

CBMACD04 polypeptides are responsible for many biological functions, including many pathologies. 
Accordingly, it is desirous to find compounds and drugs which stimulate CBMACD04 polypeptide on 
5 the one hand and which can inhibit the function of CBMACD04 polypeptide on the other hand. In 

general, agonists are employed for therapeutic and prophylactic purposes for such conditions as cancer 
and AIDS. Antagonists may be employed for a variety of therapeutic and prophylactic purposes for 
such conditions as cancer and AIDS. 

In general, such screening procedures may involve using appropriate cells which express the 
1 0 CBMACD04 polypeptide or respond to CBMACD04 polypeptide of the present invention. Such cells 
include cells from mammals, yeast, Drosophila or E. coli. Cells which express the CBMACD04 
polypeptide (or cell membrane containing the expressed polypeptide) or respond to CBMACD04 
polypeptide are then contacted with a test compound to observe binding, or stimulation or inhibition of a 
functional response. The ability of the cells which were contacted with the candidate compounds is 
1 5 compared with the same cells which were not contacted for CBMACD04 activity. 

The assays may simply test binding of a candidate compound wherein adherence to the 
cells bearing the CBMACD04 polypeptide is detected by means of a label directly or indirectly 
associated with the candidate compound or in an assay involving competition with a labeled 
competitor. Further, these assays may test whether the candidate compound results in a signal 
20 generated by activation of the CBMACD04 polypeptide, using detection systems appropriate to the 
cells bearing the CBMACD04 polypeptide. Inhibitors of activation are generally assayed in the 
presence of a known agonist and the effect on activation by the agonist by the presence of the 
candidate compound is observed. 

Further, the assays may simply comprise the steps of mixing a candidate compound with a 
25 solution containing a CBMACD04 polypeptide to form a mixture, measuring CBMACD04 activity 
in the mixture, and comparing the CBMACD04 activity of the mixture to a standard. 

The CBMACD04 cDNA, protein and antibodies to the protein may also be used to 
configure assays for detecting the effect of added compounds on the production of CBMACD04 
mRNA and protein in cells. For example, an ELISA may be constructed for measuring secreted or 
30 cell associated levels of CBMACD04 protein using monoclonal and polyclonal antibodies by 

standard methods known in the art, and this can be used to discover agents which may inhibit or 
enhance the production of CBMACD04 (also called antagonist or agonist, respectively) from 
suitably manipulated cells or tissues. 
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The CBMACD04 protein may be used to identify membrane bound or soluble receptors, if 
any, through standard receptor binding techniques known in the art. These include, but are not 
limited to, ligand binding and crosslinking assays in which the CBMACD04 is labeled with a 
radioactive isotope (eg 1251), chemically modified (eg biotinylated), or fused to a peptide sequence 
5 suitable for detection or purification, and incubated with a source of the putative receptor (cells, 
cell membranes, cell supernatants, tissue extracts, bodily fluids). Other methods include 
biophysical techniques such as surface plasmon resonance and spectroscopy. In addition to being 
used for purification and cloning of the receptor, these binding assays can be used to identify 
agonists and antagonists of CBMACD04 which compete with the binding of CBMACD04 to its 
1 0 receptors, if any. Standard methods for conducting screening assays are well understood in the art. 

Examples of potential CBMACD04 polypeptide antagonists include antibodies or, in some 
cases, oligonucleotides or proteins which are closely related to the ligands, substrates, receptors, 
enzymes, etc., as the case may be, of the CBMACD04 polypeptide, e.g., a fragment of the ligands, 
substrates, receptors, enzymes, etc.; or small molecules which bind to the polypetide of the present 
1 5 invention but do not elicit a response, so that the activity of the polypeptide is prevented. 

Thus in another aspect, the present invention relates to a screening kit for identifying 
agonists, antagonists, ligands, receptors, substrates, enzymes, etc. for CBMACD04 polypeptides; 
or compounds which decrease or enhance the production of CBMACD04 polypeptides, which 
comprises: 

20 (a) a CBMACD04 polypeptide, preferably that of SEQ ID NO:2; 

(b) a recombinant cell expressing a CBMACD04 polypeptide, preferably that, of S EQ ID NO:2 ; 

(c) a cell membrane expressing a CBMACD04 polypeptide; preferably that of SEQ ID NO: 2; or 

(d) antibody to a CBMACD04 polypeptide, preferably that of SEQ ID NO: 2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
25 component. 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating abnormal conditions such as, cancer and AIDS, 
related to both an excess of and insufficient amounts of CBMACD04 polypeptide activity. 
30 lfihe activity of CBMACD04 polypeptide is in excess, several approaches are available. One 

approach comprises administering to a subject an inhibitor compound (antagonist) as hereinabove 
described along with a pharmaceutical^ acceptable carrier in an amount effective to inhibit the function 
of the CBMACD04 polypeptide, such as, for example, by blocking the binding of ligands, substrates, 
receptors, enzymes, etc., or by inhibiting a second signal, and thereby alleviating the abnormal condition. 

19 
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In another approach, soluble forms of CBMACD04 polypeptides still capable of binding the ligand, 
substrate, enzymes, receptors, etc. in competition with endogenous CBMACD04 polypeptide may 
be administered. Typical embodiments of such competitors comprise fragments of the 
CBMACD04 polypeptide. 
5 In still another approach, expression of the gene encoding endogenous CBMACD04 

polypeptide can be inhibited using expression blocking techniques. Known such techniques involve 
the use of antisense sequences, either internally generated or separately administered. See, for 
example, O'Connor, JNeurochem (1991) 56:560 in Oligodeoxvnucleotides as Antisense Inhibitors 
of Gene Expression. CRC Press, Boca Raton, FL (1988). Alternatively, oligonucleotides which 
10 form triple helices with the gene can be supplied. See, for example, Lee et al, Nucleic Acids Res 
(1979) 6:3073; Cooney et al., Science (1988) 241:456; Dervan et al. Science (1991) 25 1: 1360. 
These oligomers can be administered per se or the relevant oligomers can be expressed in vivo. 

For treating abnormal conditions related to an under-expression of CBMACD04 and its 
activity, several approaches are also available. One approach comprises administering to a subject a 
1 5 therapeutically effective amount of a compound which activates CBMACD04 polypeptide, i.e., an 
agonist as described above, in combination with a pharmaceutically acceptable carrier, to thereby 
alleviate the abnormal condition. Alternatively, gene therapy may be employed to effect the endogenous 
production of CBMACD04 by the relevant cells in the subject. For example, a polynucleotide of the 
invention may be engineered for expression in a replication defective retroviral vector, as discussed 
20 above. The retroviral expression construct may then be isolated and introduced into a packaging cell 
- transduced with a retroviral plasmid vector containing RNA encoding a polypeptide of the present , 
invention such that the packaging cell now produces infectious viral particles containing the gene of 
interest. These producer cells may be administered to a subject for engineering cells in vivo and 
expression of the polypeptide in vivo. For overview of gene therapy, see Chapter 20. Gene Therapy and 
25 other Molecular Genetic-based Therapeutic Approaches, (and references cited therein) in Human 
Molecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). Another 
approach is to administer a therapeutic amount of CBMACD04 polypeptides in combination with a 
suitable pharmaceutical carrier. 

30 Formulation and Administration 

Peptides, such as the soluble form of CBMACD04 polypeptides, and agonists and antagonist 
peptides or small molecules, may be formulated in combination with a suitable pharmaceutical carrier. 
Such formulations comprise a therapeutically effective amount of the polypeptide 0 r compound, and a 
pharmaceutically acceptable carrier or excipient. Such carriers include but are not limited to, saline, 
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buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. Formulation should suit the 
mode of administration, and is well within the skill of the art. The invention further relates to 
pharmaceutical packs and kits comprising one or more containers filled with one or more of the 
ingredients of the aforementioned compositions of the invention. 
5 Polypeptides and other compounds of the present invention may be employed alone or in 

conjunction with other compounds, such as therapeutic compounds. 

Preferred forms of systemic administration of the pharmaceutical compositions include injection, 
typically by intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include transmucosal and 
1 0 transdermal administration using penetrants such as bile salts or fiisidic acids or other detergents. In 

addition, if properly formulated in enteric or encapsulated formulations, oral administration may also be 
possible. Administration of these compounds may also be topical and/or localized, in the form of salves, 
pastes, gels and the like. 

The dosage range required depends on the choice of peptide, the route of administration, the 
1 5 nature of the formulation, the nature of the subject's condition, and the judgment of the attending 

practitioner. Suitable dosages, however, are in the range of 0.1-100 jag/kg of subject. Wide variations in 
the needed dosage, however, are to be expected in view of the variety of compounds available and the 
differing efficiencies of various routes of administration. For example, oral administration would be 
expected to require higher dosages than administration by intravenous injection. Variations in these 
20 dosage levels can be adjusted using standard empirical routines for optimization, as is well understood in 
the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, in treatment 
modalities often referred to as "gene therapy" as described above. Thus, for example, cells from a 
subject may be engineered with a polynucleotide, such as a DNA or RNA, to encode a polypeptide ex 
25 vivo, and for example, by the use of a retroviral plasmid vector. The cells are then introduced into the 
subject. 

All publications, including but not limited to patents and patent applications, cited in this 
specification are herein incorporated by reference as if each individual publication were specifically 
and individually indicated to be incorporated by reference herein as though fully set forth. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: WU, JI-SHENG 
FU, GANG 
ZHANG, QING-HUA 
WANG, YA-XIN 

(ii) TITLE OF THE INVENTION: A Zinc Finger Protein 

Derived From Hematopoietic Cells 

(iii) NUMBER OF SEQUENCES : 2 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: RATNER & PRESTIA 

(B) STREET: P.O. BOX 980 

(C) CITY: VALLEY FORGE 
20 (D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19482 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
'(C)' OPERATING SYSTEM : DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: TO BE ASSIGNED 

(B) FILING DATE: 

(C) CLASSIFICATION: UNKNOWN 



35 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



40 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: PRESTIA, PAUL F 

(B) REGISTRATION NUMBER: 23,031 
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(C) REFERENCE /DOCKET NUMBER: GP- 70385 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-407-0700 

(B) TELEFAX: 610-407-0701 

(C) TELEX: 846169 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


1 : 






GGGGATATGG 


CGGGGCCTTT 


GTCTCTCGCT 


GTCGCCGGAG 


TCCCAGGTCT 


GTCTTCACTG 


60 


CTCTGTGTCC 


TCTGCTCCTA 


GAGGCCCAGC 


CTCTGTGGCG 


CTGTTACCAG 


CAGTATTGGA 


120 


GATCCACAGC 


TAAGATGCCA 


GGACCCCCTA 


GAAG CCTAGA 


AATGGGACTG 


TTGACATTTA 


180 


GGGATGTGGC 


CATAGAATTC 


TCTCTGGAGG 


AGTGGCAACA 


CCTGGACATT 


GCACAGCAGA 


240 


ATTTATATAG 


AAATGTGATG 


TTAGAGAACT 


ACAGAAACCT 


GGCCTTCCTG 


GGTATTGCTG 


300 


TCTCTAAGCC 


AGACCTGATC 


ACCTGTCTGG 


AACAAGGGAA 


AGAGCCCTGG 


AATATGAAGC 1 


360 


GACATGAGAT 


GGTGGATGAA 


CCCCCAGGTA 


TGTGTCCTCA 


TTTTGCTCAA 


GACCTTTGGC f ' 


420 


CAGAGCAGGG 


CATGGAAGAT 


TCTTTTCAAA 


AAG CAATACT 


GAGAAGATAT 


GGAAAATATG 


480 


GACATGAGAA 


TTTACAGTTA 


AGAAAAGGCT 


GTAAAAGTGT 


GGATGAGTAT 


AAGGTGAACA 


540 


AAGAAGGTTA 


TAATGGACTT 


AACCAGTGTT 


TCACAACTGC 


C C AGAGC AAA 


GTATTTCAAT 


600 


GTGATAAATA 


TTTGAAAGTC 


TTCTATAAAT 


TTTTAAATTC 


AAACAGACCT 


AAGATAAGAC 


660 


ATACTGAAAA 


GAAATCTTTC 


AAATGTAAAA 


AACGTGTCAA 


ATTATTTTGC 


ATGCTTTCAC 


720 


ATAAAACCCA 


ACACAAAAGC 


ATTTATCATA 


GAGAGAAGTC 


CTACAAATGT 


AAAGAATGTG 


780 


GAAAAACCTT 


TAATTGGTCC 


TCAACCCTTA 


CTAATCATAG 


GAAAATTTAT 


ACTGAAGAGA 


840 


AACCTTACAA 


ATGTGAAGAA 


TATAACAAAT 


CTCCTAAGCA 


ACTCTC AAC C 


CTTACTACAC 


900 


ATGAAATAAT 


TCATGCTGGA 


GAGAAACTCT 


ACAAATGTGA 


AGAATGTGGC 


AAAGCTTTTA 


960 


ATCGGTCCTC 


AACTTTTACT 


AAACATAAGG 


TAATTCATAC 


TGGAGTAAAA 


CCCTACAAAT 


1020 


GTGAAGAATG 


TGGCAAAGCA 


TTTTTCTGGT 


CCTCAACCCT 


AACTAAACAT 


AAGAGAATTC 


1080 


ATACTGGAGA 


GCAACCCTAC 


AAATGGGAAA 


AATTTGGCAA 


AGCCTTTAAT 


CGGTCCTCGC 


1140 


ACCTCACCAC 


AGATAAGATA 


ACTC AT AC TG 


G AG AG AAAT C 


TTACAAGTAT 


GAATAATGTG 


1200 


CCAAAGCCTA 


AGAAAACCCT 


CAATTCTTAA 


TAGATATAAG 


ATTATTCCTA 


CTGGAGAGAA 


1260 


ACTACAAACC 


TGAGAGAGGC 


GCTAATGCTT 


TTGACAGTAC 


CTAAAACTTT 


AAAGAAAATC 


1320 


ATTCTGCTGA 


AAAATCCTAG 


AAATGTGAAG 


AATGTGAAAA 


AG C C TTTAAA 


TGATTGTCAC 


1380 
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AC TTG ATTGT AGGTAAGATA ATTCATACTG GAGAAAACTA CCAGTGTGAA CAACGTGGCC 1440 

AAGCTTCGAC AATGCTCACA CCCTATTGCA CAGGAAAGCA TTTATACTTG AGAAGAAATG 1500 

TACAAATATT GGCAAAGTAA AAAATCCATT AAC AC CTGCT CACATCTTAC TCAAAATTGT 156 0 

AGAGTTCATA GTAAATAAAA GCATTAAAAT TCAAAAAAAA AAAAAAAAAA AAAAAAAAA 1619 

5 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 53 amino acids 
10 (b) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 



Met Pro Gly Pro Pro Arg Ser Leu Glu Met Gly Leu Leu Thr Phe Arg 
1 5 10 15 

20 Asp Val Ala lie Glu Phe Ser Leu Glu Glu Trp Gin His Leu Asp lie 

20 25 30 

Ala Gin Gin Asn Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Arg Asn 

35 40 45 

Leu Ala Phe Leu Gly He Ala Val Ser Lys Pro Asp Leu He Thr Cys 
25 50 55 60 

Leu Glu Gin Gly Lys Glu Pro Trp Asn Met Lys Arg His Glu Met Val 
' 65 " 70 75 8*0 

Asp Glu Pro Pro Gly Met Cys Pro His Phe Ala Gin Asp Leu Trp Pro 
85 90 95 

30 Glu Gin Gly Met Glu Asp Ser Phe Gin Lys Ala He Leu Arg Arg Tyr 

100 105 HO 

Gly Lys Tyr Gly His Glu Asn Leu Gin Leu Arg Lys Gly Cys Lys Ser 

115 120 125 

Val Asp Glu Tyr Lys Val Asn Lys Glu Gly Tyr Asn Gly Leu Asn Gin 
35 130 135 140 

Cys Phe Thr Thr Ala Gin Ser Lys Val Phe Gin Cys Asp Lys Tyr Leu 
j.45 150 155 160 

Lys Val Phe Tyr Lys Phe Leu Asn Ser Asn Arg Pro Lys He Arg His 
165 170 175 

40 Thr Glu Lys Lys Ser Phe Lys Cys Lys Lys Arg Val Lys Leu Phe Cys 

180 185 190 

Met Leu Ser His Lys Thr Gin His Lys Ser He Tyr His Arg Glu Lys 
195 200 205 
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Ser Tyr Lys Cys Lys Glu Cys Gly Lys Thr Phe Asn Trp Ser Ser Thr 

210 215 220 

Leu Thr Asn His Arg Lys lie Tyr Thr Glu Glu Lys Pro Tyr Lys Cys 
225 230 235 240 

Glu Glu Tyr Asn Lys Ser Pro Lys Gin Leu Ser Thr Leu Thr Thr His 

245 250 255 

Glu lie He His Ala Gly Glu Lys Leu Tyr Lys Cys Glu Glu Cys Gly 

260 265 270 

Lys Ala Phe Asn Arg Ser Ser Thr Phe Thr Lys His Lys Val He His 

275 280 285 

Thr Gly Val Lys Pro Tyr Lys Cys Glu Glu Cys Gly Lys Ala Phe Phe 

290 295 300 

Trp Ser Ser Thr Leu Thr Lys His Lys Arg He His Thr Gly Glu Gin 
305 310 315 320 

Pro Tyr Lys Trp Glu Lys Phe Gly Lys Ala Phe Asn Arg Ser Ser His 

325 330 335 

Leu Thr Thr Asp Lys He Thr His Thr Gly Glu Lys Ser Tyr Lys Tyr 
340 345 350 

Glu 
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What is claimed is: 



1 . An isolated polynucleotide comprising a nucleotide sequence that has at least 80% 
identity over its entire length to a nucleotide sequence encoding the CBMACD04 polypeptide of SEQ ID 
NO:2; or a nucleotide sequence complementary to said isolated polynucleotide. 



2. The polynucleotide of claim 1 wherein said polynucleotide comprises the 
nucleotide sequence contained in SEQ ID NO: 1 encoding the CBMACD04 polypeptide of SEQ ID 

10 N02. 

3. The polynucleotide of claim 1 wherein said polynucleotide comprises a nucleotide 
sequence that is at least 80% identical to that of SEQ ID NO: 1 over its entire length. 

15 4. The polynucleotide of claim 3 which is polynucleotide of SEQ ID NO: 1 . 

5. The polynucleotide of claim 1 which is DNA or RNA. 

6. A DNA or RNA molecule comprising an expression system, wherein said 

20 expression system is capable of producing a CBMACD04 polypeptide comprising an amino acid 
sequence, which has at least 80% identity with the polypeptide of SEQ ID NO:2 when said 
expression system is present in a compatible host cell. 



25 



7. A host cell comprising the expression system of claim 6. 

8. A process for producing a CBMACD04 polypeptide comprising culturing a host of 
claim 7 under conditions sufficient for the production of said polypeptide and recovering the 
polypeptide from the culture. 

30 9. A process for producing a cell which produces a CBMACD04 polypeptide thereof 

comprising transforming or transfecting a host cell with the expression system of claim 6 such that 
the host cell, under appropriate culture conditions, produces a CBMACD04 polypeptide. 
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10. A CBMACD04 polypeptide comprising an amino acid sequence which is at least 
80% identical to the amino acid sequence of SEQ ID NO:2 over its entire length. 

1 1 . The polypeptide of claim 1 0 which comprises the amino acid sequence of SEQ ID 

5 NO:2. 

12. An antibody immunospecific for the CBMACD04 polypeptide of claim 10. 

13. A method for the treatment of a subject in need of enhanced activity or expression 
10 of CBMACD04 polypeptide of claim 10 comprising: 

(a) administering to the subject a therapeutically effective amount of an agonist to said 
polypeptide; and/or 

(b) providing to the subject an isolated polynucleotide comprising a nucleotide sequence 
that has at least 80% identity to a nucleotide sequence encoding the CBMACD04 polypeptide of SEQ 

1 5 ID NO:2 over its entire length; or a nucleotide sequence complementary to said nucleotide sequence in a 
form so as to effect production of said polypeptide activity in vivo. 

14. A method for the treatment of a subject having need to inhibit activity or 
expression of CBMACD04 polypeptide of claim 10 comprising: 

20 ( a ) administering to the subject a therapeutically effective amount of an antagonist to 
said polypeptide; and/or 

(b) administering to the subject a nucleic acid molecule that inhibits the expression of 
the nucleotide sequence encoding said polypeptide; and/or 

(c) administering to the subject a therapeutically effective amount of a polypeptide 
25 that competes with said polypeptide for its ligand, substrate , or receptor. 

15. A process for diagnosing a disease or a susceptibility to a disease in a subject 
related to expression or activity of CBMACD04 polypeptide of claim 10 in a subject comprising: 

(a) determining the presence or absence of a mutation in the nucleotide sequence 
30 encoding said CBMACD04 polypeptide in the genome of said subject; and/or 

(b) analyzing for the presence or amount of the CBMACD04 polypeptide expression in 
a sample derived from said subject. 
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16. A method for identifying compounds which inhibit (antagonize) or agonize the 
CBMACD04 polypeptide of claim 10 which comprises: 

(a) contacting a candidate compound with cells which express the CBMACD04 
polypeptide (or cell membrane expressing CBMACD04 polypeptide) or respond to CBMACD04 

5 polypeptide; and 

(b) observing the binding, or stimulation or inhibition of a functional response; or 
comparing the ability of the cells (or cell membrane) which were contacted with the candidate 
compounds with the same cells which were not contacted for CBMACD04 polypeptide activity. 

JO 17. An agonist identified by the method of claim 1 6 

18. An antagonist identified by the method of claim 1 6 . 

19. A recombinant host cell produced by a method of Claim 9 or a membrane thereof 
1 5 expressing a CBMACD04 polypeptide. 
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